590 research outputs found

    CUDA programs for solving the time-dependent dipolar Gross-Pitaevskii equation in an anisotropic trap

    Full text link
    In this paper we present new versions of previously published numerical programs for solving the dipolar Gross-Pitaevskii (GP) equation including the contact interaction in two and three spatial dimensions in imaginary and in real time, yielding both stationary and non-stationary solutions. New versions of programs were developed using CUDA toolkit and can make use of Nvidia GPU devices. The algorithm used is the same split-step semi-implicit Crank-Nicolson method as in the previous version (R. Kishor Kumar et al., Comput. Phys. Commun. 195, 117 (2015)), which is here implemented as a series of CUDA kernels that compute the solution on the GPU. In addition, the Fast Fourier Transform (FFT) library used in the previous version is replaced by cuFFT library, which works on CUDA-enabled GPUs. We present speedup test results obtained using new versions of programs and demonstrate an average speedup of 12 to 25, depending on the program and input size.Comment: 7 pages, 2 figures; to download the programs, click other formats and download the sourc

    Leveraging Interdisciplinary Education Toward Securing the Future of Connected Health Research in Europe: Qualitative Study

    Get PDF
    Background: Connected health (CH) technologies have resulted in a paradigm shift, moving health care steadily toward a more patient-centered delivery approach. CH requires a broad range of disciplinary expertise from across the spectrum to work in a cohesive and productive way. Building this interdisciplinary relationship at an earlier stage of career development may nurture and accelerate the CH developments and innovations required for future health care. Objective: This study aimed to explore the perceptions of interdisciplinary CH researchers regarding the design and delivery of an interdisciplinary education (IDE) module for disciplines currently engaged in CH research (engineers, computer scientists, health care practitioners, and policy makers). This study also investigated whether this module should be delivered as a taught component of an undergraduate, master’s, or doctoral program to facilitate the development of interdisciplinary learning. Methods: A qualitative, cross-institutional, multistage research approach was adopted, which involved a background study of fundamental concepts, individual interviews with CH researchers in Greece (n=9), and two structured group feedback sessions with CH researchers in Ireland (n=10/16). Thematic analysis was used to identify the themes emerging from the interviews and structured group feedback sessions. Results: A total of two sets of findings emerged from the data. In the first instance, challenges to interdisciplinary work were identified, including communication challenges, divergent awareness of state-of-the-art CH technologies across disciplines, and cultural resistance to interdisciplinarity. The second set of findings were related to the design for interdisciplinarity. In this regard, the need to link research and education with real-world practice emerged as a key design concern. Positioning within the program context was also considered to be important with a need to balance early intervention to embed integration with later repeat interventions that maximize opportunities to share skills and experiences. Conclusions: The authors raise and address challenges to interdisciplinary program design for CH based on an abductive approach combining interdisciplinary and interprofessional education literature and the collection of qualitative data. This recipe approach for interdisciplinary design offers guidelines for policy makers, educators, and innovators in the CH space. Gaining insight from CH researchers regarding the development of an IDE module has offered the designers a novel insight regarding the curriculum, timing, delivery, and potential challenges that may be encountered

    Tailor: Altering Skip Connections for Resource-Efficient Inference

    Full text link
    Deep neural networks use skip connections to improve training convergence. However, these skip connections are costly in hardware, requiring extra buffers and increasing on- and off-chip memory utilization and bandwidth requirements. In this paper, we show that skip connections can be optimized for hardware when tackled with a hardware-software codesign approach. We argue that while a network's skip connections are needed for the network to learn, they can later be removed or shortened to provide a more hardware efficient implementation with minimal to no accuracy loss. We introduce Tailor, a codesign tool whose hardware-aware training algorithm gradually removes or shortens a fully trained network's skip connections to lower their hardware cost. Tailor improves resource utilization by up to 34% for BRAMs, 13% for FFs, and 16% for LUTs for on-chip, dataflow-style architectures. Tailor increases performance by 30% and reduces memory bandwidth by 45% for a 2D processing element array architecture

    Zuclopenthixol decanoate in pregnancy: Successful outcomes in two consecutive off springs of the same mother

    Get PDF
    Introduction. Almost all individual antipsychotics are classified into the intermediate pregnancy risk category as no or limited data exist about human pregnancy outcomes. We presented the case of zuclopenthixol decanoate using in two successive pregnancies of the same woman, which had not been published in the available peer-reviewed literature. Case report. A middle-age female subject who suffered from schizophrenia received zuclopenthixol decanoate injection during her two consecutive pregnancies. About four and a half months before diagnosis of the first pregnancy (~3.5 years after psychosis emergence), zuclopenthixol decanoate (400 mg every other week, im injection) was introduced to the treatment protocol (due to previous non-compliance with halo-peridol and risperidone). A significant clinical improvement was achieved and the dose during pregnancy was reduced to 200 mg once monthly and maintained to date. In both pregnancies the women gave birth to healthy girls who have been developing normally until now, at their ages of 6 months and of 3.5 years. During pregnancy and after giving birth to children the mothers' psychiatric status and her social functioning were significantly improved and are still stable. Close monitoring of the mother's health, a multidisciplinary approach to both her treatment and the monitoring of pregnancies as well as the complete compliance with the prescribed drug protocol were likely to be crucial for the therapeutic success. Conclusion. A favorable outcome of the present case suggests that the zuclopenthixol decanoate is a rational therapeutic option for pregnant women suffering from psychosis when the expected benefit exceed the potential risk, but a definitive evidence for its safety requires large, controlled studies

    Automatic heterogeneous quantization of deep neural networks for low-latency inference on the edge for particle detectors

    Full text link
    Although the quest for more accurate solutions is pushing deep learning research towards larger and more complex algorithms, edge devices demand efficient inference and therefore reduction in model size, latency and energy consumption. One technique to limit model size is quantization, which implies using fewer bits to represent weights and biases. Such an approach usually results in a decline in performance. Here, we introduce a method for designing optimally heterogeneously quantized versions of deep neural network models for minimum-energy, high-accuracy, nanosecond inference and fully automated deployment on chip. With a per-layer, per-parameter type automatic quantization procedure, sampling from a wide range of quantizers, model energy consumption and size are minimized while high accuracy is maintained. This is crucial for the event selection procedure in proton-proton collisions at the CERN Large Hadron Collider, where resources are strictly limited and a latency of O(1) μ{\mathcal O}(1)~\mus is required. Nanosecond inference and a resource consumption reduced by a factor of 50 when implemented on field-programmable gate array hardware are achieved
    corecore